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Abstract 

The interference channel with degraded message sets (IC-DMS) refers to a communication model 
in which two senders attempt to communicate with their respective receivers simultaneously through a 
common medium, and one of the senders has complete and a priori (non-causal) knowledge about the 
message being transmitted by the other. A coding scheme that collectively has advantages of cooperative 
coding, collaborative coding, and dirty paper coding, is developed for such a channel. With resorting to 
this coding scheme, achievable rate regions of the IC-DMS in both discrete memoryless and Gaussian 
cases are derived, which, in general, include several previously known rate regions. Numerical examples 
for the Gaussian case demonstrate that in the high-interference-gain regime, the derived achievable rate 
regions offer considerable improvements over these existing results. 
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I. Introduction 

The interference channel with degraded message sets (IC-DMS) refers a communication model 
in which two senders attempt to communicate with their respective receivers simultaneously 
through a common medium, and one of the senders has complete and a priori (non-causal) 
knowledge about the message being transmitted by the other. Such a model generically 
characterizes some realistic communication scenarios taking place in cognitive radio channels 
[1], [2] or in wireless sensor networks over a correlated field [3], [4], which we illustrate in 
Figs. [Eta) and Hfo). 

From an information-theoretic perspective, the IC-DMS have been investigated in [l]-[4]. 
Specifically, several achievable rate results have been obtained in [l]-[4], and the capacity regions 
for two special cases have been characterized in [2]-[4]. The main achievable rate region in 
[1] was obtained by incorporating the Gel'fand-Pinsker coding [5] into the well-known coding 
scheme applied to the interference channel (IC) [6], [7]. In this coding scheme, each of the two 
senders splits its message into two sub-messages, and allows its non-pairing receiver to decode 
one of the sub-messages. Knowing the two sub-messages and the corresponding codewords which 
sender 1 wishes to transmit, sender 2 applies the Gel'fand-Pinsker coding to encode its own sub- 
messages by treating the codewords of sender 1 as known interferences. It has been also shown 
in [1, Corollary 2] that, an improved achievable rate region can be attained by time-sharing 
between the early derived rate region and a so called fully-cooperative rate point achieved by 
letting sender 2 use all its power to transmit sender 1' messages. A different coding scheme was 
adopted in [2] and [3], in which neither of the senders splits its message into two sub-messages, 
and receiver 2 does not decode any transmitted information from sender 1 . Since sender 2 knows 
what sender 1 wishes to transmit, sender 2 is allowed to: 1) apply the Gel'fand-Pinsker coding to 
encode its own message; and 2) partially cooperate with sender 1 using superposition coding. It 
has been proven in [2], [3] that, this is the capacity-achieving scheme for the Gaussian IC-DMS 
in the low-interference-gain regime, in which the normalized link gain between sender 2 and 
receiver 1 is less than or equal to 1. 

However, in practice, due to the mobility of the users or random distributions of the sensors, 




Fig. 1. (a) A genie-aided cognitive radio channel [1], in which the Genie informs the cognitive user of what the primary 
user will transmit; (b) A four-node wireless sensor network [3], in which sender 2 senses a larger area such that it knows what 
information sender 1 obtains. 
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Fig. 2. An interference channel with degraded message sets in which sender 2 is close to receiver 1. 



sender 2 may be geographically located near to receiver 1, as illustrated in Fig. [2] It is likely, in 
such a situation, that the Gaussian IC-DMS is in the high-interference-gain regime, in which the 
normalized link gain between sender 2 and receiver 1 is greater than 1. In fact, the findings in 
this paper reveal that the achievable rate region, which was proven to be the capacity region in 
the low-interference-gain regime in [2] and [3], is strictly non-optimal for the Gaussian IC-DMS 
in the high-interference-gain regime. 

In this paper, we develop a new coding scheme for the IC-DMS to improve existing achievable 
rate regions. Our coding scheme differs from one proposed in [2], [3] in the way that, sender 
2 splits its message into two sub-messages, and encodes both sub-messages using Gel'fand- 
Pinsker coding. Moreover, receiver 1 is required to jointly decode the message from sender 1 
and one sub-message from sender 2. With this coding scheme, we derive our main achievable 
rate region for the discrete memoryless case. For comparison purpose, we compromise either 
the coding flexibility (fixing an auxiliary random variable as a constant), or the advantage of 
simultaneous decoding [7], to obtain two subregions of the main achievable rate region. The 
obtained subregions are shown to either include or be the same as the existing ones. We further 
extend the obtained regions from the discrete memoryless case to the Gaussian case, and show 
by numerical examples that our Gaussian achievable rate results strictly improve the existing 
ones in the high-interference-gain regime. 

The rest of the paper is organized as follows. In Section [II], we introduce the channel model 
of the IC-DMS, and the related terminologies. In Section Unl we present the main achievable 
result for the discrete memoryless case with a detailed proof. In Section [IV] we derive two 
subregions of the main achievable rate region, and we show that the derived subregions include 
several existing results as special cases. Lastly, in Section [V] we extend our rate regions from 
the discrete memoryless case to the Gaussian case, and compare them with the existing results. 

Notations: Random variables and their realizations are denoted by upper case letters and lower 
case letters respectively, e.g., X and x. Bold lower (upper) case letters are used to denote vectors 
(matrices), e.g., x and S. Calligraphic fonts are used to denote sets, e.g., X and 71. 

II. The Channel Model 

Consider the IC-DMS (also termed as the genie-aided cognitive radio channel in [1]) 
depicted in Fig. [3j in which sender 1 wishes to transmit a message (or message index), 
Wi G Mi := {!,•••, Mi}, to receiver 1 and sender 2 wishes to transmit its message, 
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Fig. 3. An interference channel with degraded message sets. 



w 2 £ M 2 '■= {I; •••) M 2 }, to receiver 2. Typically, this discrete memoryless IC-DMS is described 
by a tuple (X\, X 2 , 3^i, 3^,p(2/i, 2/2 l^i, ^2)), where X\ and are the channel input alphabets, 3^1 
and y 2 are the channel output alphabets, and p(yi, y 2 \xi , x 2 ) denotes the conditional probability 
of y 2 ) G x 3^ given (xi, x 2 ) £ X x x X 2 . The channel is discrete memoryless in the sense 
that 

P(Vl,t, V2,t\ x l,t, %2,t, Xl,t-1, %2,t-l, •••) = P(Vl,t, V2,t\Xl,t, X 2 ,t), (l) 

for every discrete time instant t in a synchronous transmission. In terms of the channel input- 
output relationship, the IC-DMS is the same as the IC. However, in the IC-DMS, sender 2 is 
able to noncausally obtain the knowledge of the message w±, which will be transmitted from 
sender 1. This is the key difference between the IC-DMS and IC in terms of the information 
flow. We next present the following standard definitions with regard to the existence of codes 
and achievable rates for the discrete memoryless IC-DMS channel. 

Definition 1: An (M ly M 2 ,n, P e ) code exists for the discrete memoryless IC-DMS, if and 
only if there exist two encoding functions 

h : Mi - X?, f 2 :M 1 xM 2 ^ X 2 \ 

and two decoding functions 

gi-W^Mx, 92-.y^M 2 , 

such that max{Pg"\ P^} < Pe, where P e Y and P^ 2 denote the respective average probabilities 
of error at decoders 1 and 2, and are computed as 

p ti = jfjf p^ 1 ^ w i\( w i> w *) were sent )' 

P S = jjjf Yl p ^ 2 ^ ^lOi, w 2 ) were sent). 
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Definition 2: A non-negative rate pair (R 1: R 2 ) is achievable for the IC-DMS, if for any given 
< P e < 1 and any sufficiently large n, there exists a (2 nRl ,2 nR2 ,n, P e ) code for the channel. 
The capacity region of the IC-DMS is the set of all the achievable rate pairs for the channel, 
and an achievable rate region is a subset of the capacity region. 

It should be noted that from an information-theoretic standpoint, the IC can not be simply 
treated as a special case of the IC-DMS in the sense that the capacity region of the IC-DMS, if 
any, does not imply a capacity region of the IC. 

III. An Achievable Rate Region for the Discrete Memoryless IC-DMS 

In this section, we present the main achievable rate region for the discrete memoryless IC- 
DMS, which is the primary result in this paper. 

Consider auxiliary random variables W, U, U, V, V and a time-sharing random variable Q, 
defined on arbitrary finite sets W, U, U, V, V and Q respectively. Let V denote the set of all 
joint probability distributions p(-) that factor in the form of 

p(q, w, x u u, u, v, v, x 2 , yi, y 2 ) =p(q)p(w, x x \q)p{u, u\w, q)p(v, v\w, q) 

■ p(x 2 \u, v, w, q)p(yi, y 2 \xii x 2 ), (2) 

where w, u, u, v, v, and q are realizations of random variables W, U, U, V, V and Q. 

Let lZ(p) denote the set of all non-negative rate pairs (Ri, R 2 ) such that the following 
inequalities hold simultaneously 

Hi</(W;W|Q), (3) 

Ri < I(UV; Y 2 \Q) - I{U- W\Q) - I(V; W\Q), (4) 

Ri + R 2 < I(UW] Y X \Q) + I(V; Y 2 U\Q) - I(U; W\Q) - I(V; W\Q); (5) 

0< IiUWMlQ) -I(U; W\Q), (6) 

0<I(U;Y 2 V\Q)-I(U;W\Q), (7) 

0<I(V;Y 2 U\Q)-I(V;W\Q), (8) 

< I(UV; Y 2 \Q) - I(U; W\Q) - I(V; W\Q), (9) 

for a given joint distribution p(-) E V. 

Let C denote the capacity region of the discrete memoryless IC-DMS, and let 

n= |J nip). 

Theorem 1: The region n is an achievable rate region for the discrete memoryless IC-DMS, 
i.e., n C C. 

Proof: Before presenting the proof of Theorem Q3 we state the following lemma as it will 
be frequently used in the proof. 

Lemma 1 ( [8, Theorem 14.2.3]): Let denote the typical set for the probability mass 
distribution p(s ll s 2 , s 3 ), and let P(S[ = s 1; = s 2 , S' 3 = s 3 ) = ]J? =1 p(sii\s 3i )p(s 2i \s 3i )p(s 3i ), 
then P{(S' 1 ,S / 2 ,S 3 ) G Ai n) } = 2-»(^i;^|sS)±eO. 

To prove this theorem we apply the notion of the asymptotic equipartition property (APE) 
[8]. Our coding scheme is mainly based on the arguments of superposition coding [9] and 
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Gel'fand-Pinsker coding [5]. Specifically, sender 1 independently encodes its message W\ as a 
whole; while sender 2 needs split its message into two parts, i.e., w 2 = (w 2 i,w 2 2), and encode 
them separately. Both u> 2 i and w 22 are encoded using the Gel'fand-Pinsker approach, but they are 
processed differently at the receivers. The message w 22 will be decoded by receiver 2 only, while 
w 2 i will be decoded by both receivers. Moreover, knowing the message and codeword which 
sender 1 is going to transmit, sender 2 not only can apply Gel'fand-Pinsker coding to deal with 
the known interference, but also can cooperate with sender 1 to transmit w\ using superposition 
coding. Let R 21 and R 22 denote the rates of w 2 i and w 22 respectively, i.e., w 2 i G {1, . . . , 2 nR21 } 
and w 22 G {1, . . . , 2 nR22 }. If receiver 1 can decode w\ and receiver 2 can decode both w 2 ± and 
w 22 with vanishing probabilities of error, then (R 1 , R 2 i + R 22 ) is an achievable rate pair for the 
IC-DMS. 

To prove that the entire region 1Z is achievable for the channel, it is sufficient to prove that 
1Z(p) is achievable for a fixed joint probability distribution p(-) G V. 



A. Random Codebook Generation 

Consider a fixed joint distribution p(-) G V, and a random time-sharing codeword q of length 
n, which is given to both senders and receivers. The codeword q is assumed to be generated 
according to YY* =1 p(qi). 

Generate 2 nRl independent codewords w(j), j G {1, . . . ,2 ni?1 }, according to Yli=iP( w i\ ( li) : > 
and for each w(j) generate one Xi(j), according to Yl*=iP( x ii\ w iQi)- Similarly, generate 2 nR21 
independent codewords u(/i), li G {1, . . . , 2 nR21 }, according to n^iM^kO' anc ^ generate 2 nR22 
independent codewords v(/ 2 ), l 2 G {1, . . . , 2 ni?22 }, according to YYi=iP( v Mi)- 

For each codeword pair (u(Zx), w(j)), generate one codeword u(7i,j) according 
to YYl =1 p{ui\ui{l\)wi{j)qi), and similarly for each codeword pair (v(Z 2 ), w(j)), gen- 
erate one codeword v(/ 2 ,j) according to \Xi=iP{^i\ v i{h)wi{j)qi)- Lastly, for each 
codeword triple (u(7i), v(/ 2 ), w(j)), generate one codeword x 2 (h,l 2 ,j) according to 
H^=iP( x 2i\ui(h)vi(k)wi(j)qi). 

Now uniformly distribute 2 nR ' 21 codewords u(ii) into 2 ni?21 bins indexed by k\ G 
{1, . . . , 2 nR21 } such that each bin contains 2 n ( R21 ~ R21 * > codewords; uniformly distribute 2 nR22 
codewords v(/ 2 ) into 2 nR22 bins indexed by k 2 G {1, . . . , 2 nR22 } such that each bin contains 
2 n(R 2 2-R22) codewords. 

The entire codebook is revealed to both senders and receivers. 



B. Encoding and Transmission 

We assume that the senders want to transmit a message vector (wi,w 2 i,w 22 ) = (j,ki,k 2 ). 
Sender 1 simply encodes the message as codeword Xi(j') and sends the codeword with n channel 
uses. Sender 2 will first need to look for a codeword u(/i) in bin k\ such that (u(ti), w(j), q) G 
A^ n \ and a codeword v(/ 2 ) in bin k 2 such that (v(/ 2 ), w(j), q) G Ae ■ If sender 2 fails to 
do so, it will randomly pick a codeword u(/i) from bin k\ or a codeword v(/ 2 ) from bin k 2 . 
Sender 2 then transmits codeword x 2 (7i,/ 2 ,j) through n channel uses. We further assume that 
the transmissions are perfectly synchronized. 
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C. Decoding 

Receiver 1 first looks for all the index pairs (jji) such that (w(j), u(Zi), yi, q) £ A e . If j 
in all the index pairs found are the same, receiver 1 determines W\ = j, otherwise declares an 
error. 

Receiver 2 will first look for all index pairs (fi,£ 2 ) such that (u(fi), v(/ 2 ), y 2 , q) £ A^ 1 ' . If li 
in all the index pairs found are indices of codewords from the same bin with index k\, 

and l 2 in all the index pairs found are indices of codewords v(Z 2 ) from the same bin with index 
k 2 , then receiver 2 will decode that (^21,^22) = (ki, k 2 ) were transmitted; otherwise, an error 
is declared. 



D. Evaluation of Probability of Error 

We now derive upper bounds for the probabilities of the respective error events, which may 
happen during the encoding and decoding process. Due to the symmetry of the codebook 
generation and encoding processing, the probability of error is not codeword dependent. Without 
loss of generality, we assume that (wi, w 2 i, W22) = (1, 1, 1) were encoded and transmitted. We 
next define the following three types of events: 

£ ,i = (u(a),w(l)),q)6iW 

E a , b = (w(a),u(b), yi ,q)eA^, 

4, 6 = (u(a),v(&),y 2 ,q)eAW. 

Let P e (enc2), P e (decl), and P e (dec2) denote the probabilities of error at the encoder of sender 
2, the decoder of receiver 1, and the decoder of receiver 2, respectively. 

[Evaluation of P e (enc2).] An error is made if 1) the encoder at sender 2 can not find u(Zi) 
in bin 1 such that (u(/x), w(l), q) £ Ai n \ and/or 2) it can not find v(/ 2 ) in bin 1 such that 
(v(7 2 ), w(l), q) £ Ai \ Then the probability of error at the encoder of sender 2 is bounded as 

P e (enc2)<Prl f| (u^), w(l), q) i A™ j + Pr I f| (v(f 2 ), w(l), q) £ A^ 

\u(fi)ebin 1 / \v(f 2 )ebin 1 

= n p r( E u+ n p < E tj 

u(fi)ebin 1 v(i 2 )£bin 1 

< (1 - Pr(E Ll )r^- R - } + (1 - PriE^r^-^ 

< /j _ 2-ra(/(C/;TV|Q)+6e)\2 Tl ^2i-«2l) _|_ /j _ 2-n(7(F;W|Q)+6e)\2"(' fi 22-«22) 

where (a) follows from the fact that we can obtain Pr(E~ h J > 2~ n ^ I ^ u ' w ^ +6e '> and Pr(P, 2 J > 

2 -n(I(V;W\Q)+6e) by settmg g/ = > g^ = ^ md g^ = ^ ^ g/ = y g, = ^ and g^ = q 

in Lemma [H respectively. Following the same argument in the proof of Lemma 2.1.3 of [10], 
we conclude that P e (enc2) — > as n — > +oo, if 

Rn>R*i + I(U)W\Q), (10) 
P22 >Rn + I(V;W\Q), (11) 
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are satisfied. We further choose 

R 21 = R 21 + I(U;W\Q), (12) 
R 22 = R 22 + I(V;W\Q). (13) 

Note that such a choice still ensures that P e (enc2) — > as n — •> +oo. 

[Evaluation of P e (decl)] An error is made if 1) E c ~ happens, and/or 2) there exists some j ^ 1 
such that E. s happens. Note that l\ is not required to be equal to / l5 since it is unnecessary for 
receiver 1 to decode l\ correctly. The probability of error at receiver 1 can be upper bounded as 

P e (decl) <Pr{El k Uu iyi %) 

< Pr{E c lk ) + 2 nRl Pr(E 2k ) + 2 n ( /?1+ ^ 21 )Pr(P 2 * (14) 
Choosing S[ = W, S' 2 = (Y 1; U), and S' 3 = Q in Lemma [TJ we have Pr(E 2 jJ = 

2 -n(I(W;Y 1 U\Q)±6e) Likewise we have p r ( E - ) = 2 - n ( I (WU;Y 1 \Q)±6e) j n addition it foUowS 

from AEP that Pr(P c f ) — > as n — > +oo. Thus, we infer from GU) that P e (decl) — > as 
n — > +oo, if 

Pi</(^;Yi£/|Q), (15) 
Pi + P 2 i< /(W-^IQ), (16) 

are satisfied. 

[Evaluation of P e (dec2)] An error is made if 1) Ef - happens, and/or 2) there exists some 

(h, l 2 ) in which either l\ or l 2 is not an index of any codeword from the respective bin 1. The 
probability of the second case is upper bounded by the probability of the event, E~ ~ for some 

_ A 1, 2 

(h, h) 7^ h)- Thus, the probability of error at receiver 2 is bounded as 

P.(d«2)<ft-(^ A U u AAHKtAAA) 

<Pr(E? fc )+ £ P(J5,=) 

(fi,i S a)/(ii,Ja) 

k+k (h+kk+k) 
< Pr(Ef , ) + 2 nR21 Pr(Ej , f , ) + 2 nR22 Pr(E f f - ) + 2^ 21+ ^Pr(P, , f * - ). 

(17) 
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Applying Lemma \T\ to evaluate Pr(E~ - ), Pr(E~ , . ) and Pr(E- - s . ) in (flTI) . we 

tlT^l)^ ^1^27=^2 ^17^1^27=^2 

conclude that P e (dec2) — > as n — > +oo if the following inequalities, 

^ 2 i <I(C/; WIQ), (18) 
^ 22 </(^;W|g), (19) 

^2i + ^22</(c/^ ; y 2 |g), (20) 

are satisfied. 

According to (fl"2]) . (fl3l) and the fact that R 2 = R21 + R22, we first substitute -R21 and -R22 with 
R 21 + I(U; W\Q) and R 22 + I(V; W\Q) in (Q2]>, ([16]) and (fl8T)-(l20l). and subsequently substitute 
R 21 with R 2 — R 22 in the resulting inequalities. After these two substitution steps, we have 

Ri <I(W',Y X U\Q), (21) 

R x + R 2 - R 22 < I(WU; Yi\Q) - I(U; W\Q), (22) 

# 2 - R22 < I(U; Y 2 V\Q) - I(U; W\Q), (23) 

#22 < I(V; Y 2 U\Q) - I(V; W\Q), (24) 

R2 < I(UV; Y 2 \Q) - (I(U; W\Q) + I(V; W\Q)). (25) 

Furthermore, applying Fourier- Motzkin elimination [11] to remove i? 2 2 from (|2"T])-(|251). we have 

Rx<I{W\Y x U\Q), (26) 

R2 < I(UV; Y 2 \Q) - {I{U; W\Q) + I(V; W\Q)), (27) 

R 2 < I(U; Y 2 V\Q) - I(U; W\Q) + I(V; Y 2 U\Q) - I(V; W\Q), (28) 

Ri + R 2 < I(WU; Y^Q) - I(U; W\Q) + I{V; Y 2 U\Q) - I{V; W\Q). (29) 

Since I(U; Y 2 V\Q) + I(V; Y 2 U\Q)-I{UV; Y 2 \Q) = I(U; V\Q)+I(U; V\Y 2 Q) > 0, ([27]) implies 
([28]) and thus ([28]) is redundant. To ensure that R\, R 2 \ and -R22 are non-negative, we enforce 
four additional constraints ©-([9]). Therefore, the rate region 1Z(p) is achievable for a fixed joint 
probability distribution p(-) E V, and Theorem \T\ follows. ■ 
Remark 1: The proposed coding scheme exploits three coding methods to achieve any rate 
pair in the rate region, 1Z. The first method is cooperation that is realized by the superposition 
relationship between w and x 2 through p(x 2 \u 2 , v 2 , w, q). The second is collaboration, by which 
we mean that sender 2 separates its own message into two parts, i.e., w 2 = (w 2 i,w 22 ), and 
encodes w 2 i at a possibly low rate such that receiver 1 can decode it. By doing so, the effective 
interference caused by the signals carrying the sender 2's information may be reduced. The third 
is Gel'fand-Pinsker coding, which we apply to encode both messages, w 2 \ and w 22 , from sender 
2 by treating the codeword w as known interference. This perhaps allows receiver 2 to be able 
to decode the messages from sender 2 at the same rate as if the interference caused by sender 
1 was not present [12]. 

IV. Relating with Existing Rate Regions 

In this section, we will show that Theorem \T\ includes the achievable rate regions in [2], [3]. 
To demonstrate it, we compromise the advantages of the coding scheme developed in Section 
Unl to obtain the following subregions of 1Z. 
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A. A Subregion oflZ 

Let V* denote the set of all joint probability distributions p(-) that factors in the form of 

p(q, w, zi, u, v, v, x 2 , yi, 2/2) =p(q)p(x u w\q)p(u\q)p(v, v\w, q) 

■ p(x 2 \u, v, w, q)p(yi, y 2 |zi, x 2 ). (30) 

Note that the joint distribution (l30l) differs from © in the way that conditioned on Q, U is now 
independent of any other auxiliary random variables, and U is not present. 
Let TZ s i m (p) denote the set of all non-negative rate pairs (Rx,R 2 ) such that 

Ri < IiW-^lUQ), (31) 

R 2 < I(UV; Y 2 \Q) - I{V- W\Q), (32) 

Ri + R 2 < I{WU; YilQ) + I(V; Y 2 \UQ) - I(V; W\Q)\ (33) 

0< Iiy-^UQ)- I(V;W\Q), (34) 

for a joint probability distribution p(-) E V* . Furthermore, let 

P (-)ev* 

Theorem 2: The rate region TZ^ m is achievable for the discrete memoryless IC-DMS, i.e., 
ftsim CTZCC. 

Proof: The proof can be devised from the proof of Theorem \T\ by customizing the original 
coding scheme for the new joint distribution (l30l) . We change the encoding and decoding method 
for the message w 2 \ (corresponding to U), i.e., the Gel'fand-Pinsker coding used in the proof 
of Theorem Q] was replaced by the conventional random coding. Specifically, we generate 2 nR21 
independent codewords u(&i), ki E {1, . . . , 2 nR21 }, according to Y[i=iP( u i\li)- The encoding 
and decoding are then adapted to the new codebook accordingly. Evaluating the probability of 
error in the same way as was done in the proof of Theorem [Q we obtain 



R22-R22>I(V;W\Q); (35) 

Rx^nW-^UQ), (36) 

Ri + R2i<I(WU;Y 1 \Q); (37) 

R2i<I(U;Y 2 \VQ), (38) 

R 22 <I(V;Y 2 \UQ), (39) 

#21 + £22 <I{UV-Y 2 \Q). (40) 



Again, we choose R22 — R22 = I(V; W\Q) in (1351) . and then substitute R 22 with R 22 + 
I(V;Y 2 \UQ) as well as R 2 ± with R 2 — R 22 in the group of (I36l)-(l40l). By applying Fourier- 
Motzkin elimination on the resulting inequalities to remove R 22 , and adding the constraints that 
ensure the respective rates R\, i? 2 i and R 22 to be non-negative, we obtain (l3TT)-(l34l). Therefore, 
the region R s i m (p) is achievable for a given p(-) E V*, and the theorem follows. ■ 
Note that simultaneous decoding (simultaneous joint typicality) is applied at both decoders. 
The advantage of simultaneous decoding over successive decoding is well demonstrated on the 
IC by Han and Kobayashi in [7]. We next modify the coding scheme by applying successive 
decoding instead of simultaneous decoding at both decoders to derive a subregion of 1Z sim . 
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B. A Subregion oflZ sim 

Let lZ suc (p) denote the set of all achievable rate pairs (Ri,R 2 ) such that 

i?i</(^;Yi|C/Q), (41) 
R 2 < mm{I(U; Y ± \Q), I(U; Y 2 \Q)} + I(V; Y 2 \UQ) - I(V; W\Q); (42) 

o< j(y ; yi|?/g)-/(y;^|g), (43) 

for a fixed joint probability distribution p(-) E V*. Define 

Theorem 3: The rate region 1Z SUC is achievable for the discrete memoryless IC-DMS, i.e., 

Proof: The codebook generation, encoding and transmission remain the same as those used 
to prove Theorem [2} whereas the decoding processes at both decoders are altered. Both decoders 
decode w 2 \ first, and then decoder 1 decodes w\ and decoder 2 decodes w 22 respectively. Then 
the following can easily be obtained 

R22-R22>I(V;W\Q), (44) 

Rxl<I(U;Y x \Q), (45) 

Rx^nW-^lUQ), (46) 

Rzi<I(U;Y 2 \Q) t (47) 

R 22 <I(V;Y 2 \UQ). (48) 

From (|44)) -(|48T). it is straightforward to obtain (I4T)) -(|4"31. Therefore, the region 1Z suc (p) is 
achievable, and the theorem follows immediately. ■ 
Remark 2: Note that (l45l) is only necessary when the successive decoding is applied. This is 
because every decoding step in a successive decoding scheme is expected to have a vanishing 
probability of error. 

In what follows, we further specialize the subregion TZ SUC to obtain two more achievable rate 
regions TZ sp \ and 1Z sp 2- Let V* denote the set of all joint probability density distributions p(-) 
that factor in the form of 

p(q, w, xi, v, v, x 2 , y 2 ) = p{q)p{xi, w\q)p(v, v\w, q)p(x 2 \v, w, q)p{yi, y2\xi,x 2 ). (49) 

Let lZs P i(p) denote the set of all non-negative rate pairs (Ri,R 2 ) such that 

Rx<I{W-Y x \Q), (50) 

R 2 <I(V;Y 2 \Q)-I(V;W\Q), (51) 

for a fixed joint distribution p(-) E V\. Define 

"K-spi = |J ^ sp i(p). 

p(-)evf 

Corollary 1: The region !Z spl is an achievable rate region for the discrete memoryless IC- 
DMS, i.e., 7e spl c 7e suc c n sim cncc. 
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Proof: Fixing the auxiliary random variable U as a constant, we reduce (|41"I) and (l42l) to 
(l50l) and (|5"TT) . and the corollary follows immediately. ■ 

Remark 3: We note that the region 7£ sp i is similar to the region 7?.i n reported in [3, Theorem 
3.1]. It seems that the region !Z[ n is more general than the region 1Z spl in the sense that fixing 
the auxiliary random variable U in lZ m as a constant, one can obtain a region which is the 
same as TZ sp \. Nevertheless, after examining the coding scheme used in [3, Theorem 3.1], one 
can find that there exists a one-one correspondence between codewords u(w 2 ) and x 2 (w 2 ), and 
both codewords are jointly generated and decoded, i.e., p(u,x 2 ) is used to generate two-letter 
codewords. Thus, one can introduce one auxiliary random variable W such that there exists a 
one-one mapping between W and U x X 2 , i-e., / : U x X 2 <-> W, and thus W has the probability 
mass distribution p(w) = p{f~ l {w)) = p(u,x 2 ). Replacing all (Xg(w 2 ), U n (w 2 )) by W n (w 2 ) 
in the proof of [3, Theorem 3.1] will yield the same rate region. Equivalently speaking, for 
any input distribution achieving a rate region characterized by [3, Theorem 3.1], one can find 
a corresponding joint distribution in the form of (|49l) such that Corollary \T\ yields exactly the 
same rate region. Therefore, two rate regions lZ sp i and lZ[ n are identical. 

Let V 2 denote the set of all joint probability distributions p(-) that factor in the form of 

p(q, w, xx, u, x 2 , yx, y 2 ) = p(q)p(x 1 , w\q)p(u\q)p(x 2 \u, w, q)p(yx, 2/2^1, x 2 ). (52) 

Let 7?. S p2(p) denote the set of all non-negative rate pairs (Rx,R 2 ) such that 

Rx<I(W;Yx\UQ), (53) 
R 2 <mm{I(U;Yx\Q),I(U;Y 2 \Q)}, (54) 

for a fixed joint distribution p(-) E V 2 . Define 

^•sp2 = (J ^- SP 2(P)- 

p(-)eP 2 * 

Corollary 2: The region 1Z sp 2 is an achievable rate region for the discrete memoryless IC- 
DMS, i.e., TZ sp2 C ^ suc C TZ sim CUCC. 

Proof: The proof can be devised from the proof of Theorem [3] easily by fixing V as a 
constant. ■ 

V. The Gaussian IC-DMS 

In the preceding sections, we have derived several achievable rate regions for the discrete 
memoryless IC-DMS. We now extend these results to obtain corresponding achievable rate 
regions for the Gaussian IC-DMS (GIC-DMS). 

A. The Channel Model of the GIC-DMS 

In general, with no loss of information-theoretic optimality, the GIC-DMS can be converted 
to the GIC-DMS in the standard form through invertible transformations [2], [6], [11]. We thus 
only consider the GIC-DMS in the standard form, which is represented as follows 

Y X = X 1 + yfc[X 2 + Z X , 
Y 2 = X 2 + y/ C X2 Xi + Z 2 , 
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Fig. 4. A Gaussian interference channel with degraded message sets. 



where Zj, i — 1, 2, is the additive white Gaussian noise with zero mean and unit variance, and 
y/c2\ and y/cyi are the normalized link gains in the GIC-DMS depicted in Fig. |4] Moreover, 
the transmitted codeword Xj = (xn, . . . , Xi„), i — 1, 2, is subject to an average power constraint 
given by 

xdl 2 < Pi- 

Since it has been shown in the maximum-entropy theorem in [8] that Gaussian input signals are 
optimal for Gaussian channels, we will consider Gaussian codewords Xf, i — 1,2. 

B. Achievable Rate Regions for the GIC-DMS 

1) Gaussian Extension of 1Z: We first extend 1Z to its Gaussian counterpart denoted by Q. 
To obtain the rate region Q, we map the random variables involved in the joint distribution © 
to the corresponding Gaussian random variables with the following customary constraints: 
PI) W, distributed according to M(0, 1), 
P2) Xi = VKw, 

P3) U, distributed according to Af(0,af3P 2 ), 

P4) V, distributed according to J\f (Q, a/3P 2 ), 

P5) U = U+ X X W, 

P6) V = V + X 2 W, 

P7) x 2 = U + V + ^fbxP 2 W, 

where a, (3 e [0, 1], a + a = 1, /5 + j3 — 1, Ai, A 2 G [0, +oo), and U and are mutually 
independent. The input-output relationship of the GIC-DMS can be described by 

Yi = (a/A + V / C2i«P 2 ) ^ + + v 7 ^^ + (55) 

Y 2 = U + V- + (v^+ ^12^1) W + Z 2 . (56) 

To simplify the derivations, we fix the time-sharing random variable Q as a constant. The issue 
of how this time-sharing random variable affects the achievable rate region is well addressed 
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in [13]. In the Gaussian case, the respective mutual information terms in ([3]) - © need be 
evaluated with respect to the mappings defined by P1-P7. Since the computation procedure to 
obtain Q and the resulting description of Q are fairly lengthy, we relegate them (Theorem 5) to 
the Appendix. 

2) Gaussian Extension oflZ suc : For illustration and comparison purpose, we next show how 
to obtain the Gaussian counterpart of 1Z SUC in details. Following the first step in the previous 
derivation, we also map the random variables involved in (|3Q|) to the Gaussian ones with the 
following constraints: 

Ml) W, distributed according to Af(0, 1), 
M2) X 1 = sfP x W, 

M3) U, distributed according to J\f(0,a(3P 2 ), 
M4) V, distributed according to J\f(o,ai3P 2 ), 
M5) V = V + XW, 
M6) X 2 = U + V + yf&P 2 W, 

where a,0 G [0, 1], a + a = 1, (3 + ft = 1, A G [0, +oo), and W, U and V are mutually 
independent. Using the mappings defined by M1-M6, we express the input-output relationship 
for the GIC-DMS as: 



Yi = U/P\ + v / c 2 i«P 2 ) W + ^2~iU + + Z u (57) 

Y 2 = U + V+ (y&P 2 + V^pT) W + Z 2 . (58) 
Let Qsuc(ce,P) denote the set of all the non-negative rate pairs (Ri,R 2 ) such that 




(59) 



c 2 ia(3P 2 



(y/K + \/c 2 iaP 2 ) + c 2 iaf3P 2 + 1 
apPi 

pp 2 + (V^p 2 + V^Pl) 2 + 



(60) 



Define 



£suc = (J 08oc 
a,/3e[0,l] 



Theorem 4: The region ^ suc is an achievable rate region for the GIC-DMS in the standard 
form. 

Proof: It suffices to prove that Q S uc(ce,P) is achievable for any given a,/3 £ [0, 1]. Since 
C? suc is extended from !Z mc , we need compute the mutual information terms in (141) and (|42[). 
The righthand side of ( [59b can be readily obtained through a straightforward computation of 
I(W; Yi\UQ) in (|4TI) . Recall that Q is a constant. It is also fairly straightforward to obtain the 
two terms within the minimum operator in (|60l) through computing I(U; Yi\Q) and I(U; Y 2 \Q) 
in (|42)) . We next evaluate the only remaining term I(V; Y 2 \UQ) — I(V; W\Q) for a constant Q. 
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Defining Y 2 = V + (\/aP 2 + y/cwPi) W + Z 2 , we have 

I(V; Y 2 \U) - I(V; W) = h{Y 2 \U) - h{Y 2 \UV) - I{V; W) 

= h(Y 2 )-h(Y 2 \V)-I(V; W) 
= h{Y 2 ) + h(V) - h(Y 2 V) - I(V; W). 
With V = V + XW, we evaluate dSB as 
I(V;Y 2 \U)-I(V; W) 



(61) 



=-log 2 (2vre(a/3P 2 



--log 2 ( (2vre) 2 



olP, 



'c 12 Pi 



-hg 2 (2ire(apP 2 + \ 2 )) 



apP 2 



'c 12 Pi 



- [apP 2 + A (v^+ V^Pi 
It is easy to find that when 



A 2 



o lQ g2 1 + 



apP 2 



A 



otPP 2 (v^P + v^P) 



(62) 



(63) 



(64) 



apP 2 + 1 

the term I(V; Y\U) — I(V; W) is maximized, and the maximum value is 

max[J(F; Y 2 \U) - I(V; W)\ = \ log 2 (l + e^P 2 ). 

This is in parallel with the result in [12]. 

Therefore, the rate region £ suc (a,/3) is achievable for any pair a, (3 G [0, 1], and the theorem 
follows. ■ 

In the following, we obtain two corollaries by setting (5 — and j3 — 1 in Theorem HI 
respectively. 

Corollary 3: The rate region £? spl is an achievable rate region for the GIC-DMS in the standard 
form with Q spi := |Jae[o i] ^suc(a, 0), i.e., Q spl is the union of the sets of non-negative rate pairs 
(Pi, R 2 ) satisfying 



Pi < 77 !og 2 1 + p i 1 

2 \ c 2i aP 2 + 1 y 



P 2 < -log 2 (l + «P 2 ), 

over all a G [0, 1]. 

Corollary 4: The rate region ^ sp2 is an achievable rate region for the GIC-DMS in the standard 
form with Q sp2 '■= Uqg[o i] ^suc(a, 1), i-e-, Qs P 2 is the union of the sets of non-negative rate pairs 
(Pi, P 2 ) satisfying 



Pi <^ log 2 (\ + (v 7 ^ + ^laPa) 2 ) , 



P 2 < min < - log 2 1 



c 2 ±aP 2 



1 



log 2 1 



aPo 



(yfaF 2 + Vci 2 Pi) J 
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Fig. 5. Pi = P2 = 6, C21 = 0.3, C12 = 0. (i) gives the rate region in Theorem 1 of [1]; (ii) gives the rate region in Corollary 
2 of [1]; (iii) gives the rate region in Corollary 3 (equivalently, Theorem 4.1 of [2] and Theorem 3.5 of [3]). 



over all a G [0, 1]. 

Remark 4: Corollaries [3] and @] correspond the Gaussian extensions of Corollaries \T\ and |2] 
respectively. Particularly, the rate region depicted by Corollary |3] is the same as the rate regions 
given in [2, Theorem 4.1] and [3, Theorem 3.5]. It has been proven in both [2] and [3] that 
the rate region Q sp i is indeed the capacity region for the GIC-DMS in the low-interference-gain 
regime, i.e., c 2 i < 1. 

In addition, the set of achievable rate pairs given in [2, Lemma 4.2] is contained in the region 
Q sp 2 as a subset. 

C. Numerical Examples 

We next provide several numerical examples to illustrate improvements of our achievable rate 
regions over the previously known results in [l]-[3]. Denote the achievable rate regions obtained 
in [1, Theorem 1] and [1, Corollary2] by £ d mti and Gdmt2, respectively. 

1) Comparing with Rate Regions in [1]: Fig. [5] compares the rate regions Gdmti, Gdmt2, and 
C? sp i for an extreme case in which receiver 2 does not experience any interference from sender 
1, i.e., C12 = 0. As can be seen from Fig. [5j the rate region Q spl strictly includes <?dmti> as well 
as £dmt2 obtained through time-sharing between Q dmtl and a fully-cooperative rate point. The 
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coding scheme used to establish (? dmtl incurs certain rate loss due to the fact that sender 2 does 
not use its power to help the sender l's transmissions even though it has complete and non- 
causal knowledge about the message being transmitted by sender 1. In contrast, our proposed 
coding scheme allows sender 2 to use superposition coding to help sender 1, and thus yields an 
improved rate region. 

In Fig. [6[ we consider another case in which the transmit power of sender 1 is set to zero and 
c 2 i < 1. From the figure, we observe that the rate region ^ dmt2 is strictly smaller than Q spl . Note 
that in this case, the GIC-DMS becomes a Gaussian degraded broadcast channel. According to 
[8], the optimal coding scheme for this case is: sender 2 uses a portion of its power to transmit 
the codeword conveying w±, and uses the remaining power to transmit the codeword conveying 
1D2, which is encoded by using the dirty-paper coding [14]. It is easy to verify that this scheme 
is a special case of the coding scheme developed in Theorem [T] 



1.5 




Fig. 6. Pi = 0, P2 = 6, C21 = 0.5. (i) gives the rate region in Corollary 2 of [1]; (ii) gives the rate region in Corollary 3 
(equivalently, Theorem 4.1 of [2] and Theorem 3.5 of [3]). 

2) Comparing with Rate Regions in [2], [3]: As mentioned earlier, the rate region Q spl , a 
subregion of Q, is the same as the one given in [2, Theorem 4.1] and the one given in [3, 
Theorem 3.5], which is indeed the capacity region for GIC-DMS in the low-interference-gain 
regime. In Figs. [7J and [8] we compare Q with Q spl and Q sp2 in the high-interference-gain regime, 
i.e., c 2 i > 1. As can be seen from the figures, the rate region Q strictly includes both C? spl and 
Q sp2 in this case. Comparing Fig. [7J with Fig. [8] we observe that the improvement of the rate 
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Fig. 7. Pi = Pi = 6, C21 = 2, C12 = 0.3. (i) gives the rate region in Corollary 3 (equivalently, Theorem 4.1 of [2] and 
Theorem 3.5 of [3]); (ii) gives the achievable rate region in Corollary 4; (iii) gives the achievable rate region in Theorem 5. 



region Q over £ spl becomes more pronounced as the link gain c 2 i increases. The improvement is 
mainly because our coding scheme allows receiver 1 to decode partial information from sender 
2, and thus reduces the effective interference experienced by receiver 1. In addition, it can be 
seen from the figures that in the high-interference-gain regime, Q sp i is not convex and thus is 
only suboptimal. 

VI. Conclusions 

In this paper, we have investigated the IC-DMS from an information theoretic perspective. 
We have developed a coding scheme that combines the advantages of cooperative coding, 
collaborative coding and Gel'fand-Pinsker coding. With the coding scheme, we have derived 
a new achievable rate region for such a channel, which not only includes existing results as 
special cases, but also exceeds them in the high-interference-gain regime. However, we are not 
able to establish a converse for the derived achievable rate region, because the achievable result 
is closely related to the achievable results for the interference channel and the broadcast channel, 
for which there is no converse available in general. 

Appendix 
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1.5 




Fig. 8. Pi = Pi = 6, C21 = 6, C12 = 0.3. (i) gives the rate region in Corollary 3 (equivalently, Theorem 4.1 of [2] and 
Theorem 3.5 of [3]); (ii) gives the achievable rate region in Corollary 4; (iii) gives the achievable rate region in Theorem 5. 



An Achievable Rate Region for the GIC-DMS 



In this appendix, we show how to extend 71, the achievable rate region for the discrete 
memoryless IC-DMS, to its Gaussian counterpart, Q. Note that the mappings M1-M6 of the 
auxiliary random variables are described in Section |V] We first compute the following two 
covariance matrices: 



/^ll ^12 ^13 
A*21 ^22 A*23 
A*31 A*32 ^33 



E{W 2 } E{WU} E{WY l ) 
E{WU} E{U 2 } E{UY X ) 
E{WY 1 } E{UYx} E{Y?} 



Pi XxPi 
AiPx a(3P 2 + A?Pi 

T]i^/P[ \irjx\ZP~i + Jo^a(3P 2 



Ai?7r 



fpi 



Tl + y/c^apP 2 
+ c 21 aP 2 + 1 
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^11 ^12 ^13 
^21 ^22 ^23 
^31 ^32 ^33 

a(3P 2 + AfPi 
A1A2-P1 



a(3P 2 + X^VPi a(3P 2 + X2V2VP1 



E{U 2 } E{UV} E{UY 2 } 
E{UV} E{V 2 } E{VY 2 } 
E{UY 2 } E{VY 2 } E{Y 2 2 } 

af3P 2 + \ir]2\^ 
a(3P 2 + \ 2 r) 2 ^/P[ 
aP 2 + 77I + 1 



A1A2A 
atPP 2 + XlPi 



where 



Vi = y/K + V c 2 iaP 2 , 
7/2 = y/aP 2 + y/c 12 Pi, 

and E{-} denotes the expectation of a random variable. 

Define T(x) = log 2 (x)/2, and £ = log 2 (27re)/2. We express the respective differential entropy 
terms as: 



h a = h(W)=£ + r(jji 11 ), 
h b = hiUYr) = 2£ + T 

h c = hiWUY^ = 3£ + r 

h d = h(uv) = 2^ + r 

h e = h(Y 2 )=Z + T(v 33 ), 
h f = h{UVY 2 ) = 3£ + T 

^ s = /i(w) = 2^ + r 

h h = h{Y 1 )=Z + r{»u), 

h i = h(y) = t + r(v 22 ), 

hj = h{UY 2 ) = 2£ + T 
h k = h(U) = £ + r(r/ u 
fc, = h(VY 2 ) = 2£ + r 



( 


M22 


A*23 




y"32 


A*33 



^11 ^12 /^13 
/^21 /^22 /^23 
^31 ^32 ^33 



( 




^12 




^21 


^22 



^11 ^12 ^13 
^21 ^22 ^23 
^31 ^32 ^33 



( 


/ill 


/il2 




/i21 


/i22 



( 


/ill 


AH3 


)• 


/i31 


/U33 


( 


/i22 


At23 




A*32 


/i33 



where I • I denotes the determinant of a matrix. 
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The mutual information terms in ©-© are then computed as: 

h = h a + h b - h c , 
h = h d + h e - h f , 



a/3P 2 ' 

rfi + 



a(3P 2 ' 
h — h g + hh — h c , 
I 6 = hi + hj - hf, 
h = h k + hi - h f . 

Let Q(a, /?, Ai, A 2 ) denote the set of all rate pairs (R±, R 2 ) such that the following inequalities 
are satisfied: 

Ri<h, (65) 
Ri<h-h- U, (66) 
Ri + R 2 <h + h-h- h; (67) 
< h ~ h, (68) 
< h - h, (69) 
< h - h, (70) 
</ 2 -/ 3 -j 4 . (71) 

for given a,/3 6 [0,1] and Ai,A 2 G [0, +oo). Note that (|651)-(|7T1) are directly extended from 
©-©. 

Theorem 5: The rate region Q is achievable for the GIC-DMS in the standard form with 

Q= |J A 1; A 2 ). 

a,/3e[0,l];Ai,A 2 G[0,+oo) 
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